## Scraped and parsed Home page from Internet Archive

Between 2012/01/01 to mid-2016

Data columns:-

'date', 'time', 'src', 'order', 'url', 'link_text', 'homepage_keywords', 'path', 'title', 'text', 'top_image', 'authors', 'summary', 'keywords'

Please note all columns from 'path' will be empty

|                  |   ('date', 'min') |   ('date', 'max') |   ('date', 'count') |
|:-----------------|------------------:|------------------:|--------------------:|
| ('fox', '2012')  |          20120101 |          20121231 |              218727 |
| ('fox', '2013')  |          20130101 |          20131231 |              159376 |
| ('fox', '2014')  |          20140101 |          20141231 |              377366 |
| ('fox', '2015')  |          20150101 |          20151231 |              237300 |
| ('fox', '2016')  |          20160101 |          20160805 |               59484 |
| ('hpmg', '2012') |          20120101 |          20121231 |              498477 |
| ('hpmg', '2013') |          20130101 |          20131231 |              727803 |
| ('hpmg', '2014') |          20140101 |          20141231 |             1414175 |
| ('hpmg', '2015') |          20150101 |          20151231 |              970848 |
| ('hpmg', '2016') |          20160101 |          20160806 |              501180 |
| ('nyt', '2012')  |          20120101 |          20121231 |              101405 |
| ('nyt', '2013')  |          20130101 |          20131231 |               91876 |
| ('nyt', '2014')  |          20140101 |          20141231 |              620250 |
| ('nyt', '2015')  |          20150101 |          20151231 |              708312 |
| ('nyt', '2016')  |          20160101 |          20160805 |              658245 |
| ('usat', '2012') |          20120101 |          20120929 |              144539 |
| ('usat', '2013') |          20130928 |          20131109 |                  10 |
| ('usat', '2016') |          20160408 |          20160718 |                 481 |
| ('wsj', '2012')  |          20120101 |          20121231 |              521638 |
| ('wsj', '2013')  |          20130101 |          20131231 |              452323 |
| ('wsj', '2014')  |          20140101 |          20141231 |              193664 |
| ('wsj', '2015')  |          20150101 |          20150208 |                1269 |